Statistical models for predicting number of involved nodes in breast cancer patients.

نویسندگان

  • Alok Kumar Dwivedi
  • Sada Nand Dwivedi
  • Suryanarayana Deo
  • Rakesh Shukla
  • Elizabeth Kopras
چکیده

Clinicians need to predict the number of involved nodes in breast cancer patients in order to ascertain severity, prognosis, and design subsequent treatment. The distribution of involved nodes often displays over-dispersion-a larger variability than expected. Until now, the negative binomial model has been used to describe this distribution assuming that over-dispersion is only due to unobserved heterogeneity. The distribution of involved nodes contains a large proportion of excess zeros (negative nodes), which can lead to over-dispersion. In this situation, alternative models may better account for over-dispersion due to excess zeros. This study examines data from 1152 patients who underwent axillary dissections in a tertiary hospital in India during January 1993-January 2005. We fit and compare various count models to test model abilities to predict the number of involved nodes. We also argue for using zero inflated models in such populations where all the excess zeros come from those who have at some risk of the outcome of interest. The negative binomial regression model fits the data better than the Poisson, zero hurdle/inflated Poisson regression models. However, zero hurdle/inflated negative binomial regression models predicted the number of involved nodes much more accurately than the negative binomial model. This suggests that the number of involved nodes displays excess variability not only due to unobserved heterogeneity but also due to excess negative nodes in the data set. In this analysis, only skin changes and primary site were associated with negative nodes whereas parity, skin changes, primary site and size of tumor were associated with a greater number of involved nodes. In case of near equal performances, the zero inflated negative binomial model should be preferred over the hurdle model in describing the nodal frequency because it provides an estimate of negative nodes that are at "high-risk" of nodal involvement.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Random Survival Forests for Competing Risks and Regression Models in Determining Mortality Risk Factors in Breast Cancer Patients in Mahdieh Center, Hamedan, Iran

Introduction: Breast cancer is one of the most common cancers among women worldwide. Patients with cancer may die due to disease progression or other types of events. These different event types are called competing risks. This study aimed to determine the factors affecting the survival of patients with breast cancer using three different approaches: cause-specific hazards regression, subdistri...

متن کامل

The Relationship of gravidity with the Frequency of Removed Lymph Nodes in Mastectomy and Involved Lymph Nodes after the Surgery in Women with Breast Cancer

Abstract Background: Breast cancer is the most common fatal cancer among women worldwide and it has an increasing rate in Iranian women. The aim of this study was to determine the relationship of gravidity with removed lymph nodes and involved lymph nodes after mastectomy surgery in women with breast cancer. Methods: In this ...

متن کامل

Predicting the Incidence and Trend of Breast Cancer Using Time Series Analysis for 2007-2016 in Qazvin

Introduction: Breast cancer is the most common cancer and the second leading cause of death in women worldwide. The aim of this study was to analyze the trend and predict the incidence of breast cancer using time series analysis. Methods: In this study, data on breast cancer incidence in Qazvin province between 2007 and 2016 were analyzed using time series analysis with autoregressive integrate...

متن کامل

بررسی عوامل موثر بر مدت زمان عود سرطان پستان با استفاده از مدل کاکس

Background and purpose: The aim of this study was to estimate the disease-free survival rate, in female patients with breast cancer and determining the level of influencing factors. Materials and methods: In a retrospective cohort study, the records of 377 patients attending Mashhad Omid hospital, Iran, spanning the years 2006 to 2011 were selected using convenience sampling. The patients we...

متن کامل

پیش‎‎‎‎‎‎‎‎‎‎‎‎‎‎‎‎‎‎‎‎بینی بقای بیماران مبتلا به سرطان پستان با استفاده از دو مدل رگرسیون لجستیک و شبکه عصبی مصنوعی

  Background and Objectives : recent years, considerable attention has been paid to statistical models for classification of medical data according to various diseases and their outcomes. Artificial neural networks have been successfully used for pattern recognition and prediction since they are not based on prior assumptions in clinical studies. This study compared two statistical models, arti...

متن کامل

بررسی رابطه پروتوانکوژن HER2 با فاکتورهای پروگنوستیک سرطان پستان

Background: Breast cancer is the most common female malignancy world wide and in Iran as well and is  the second cause of death due to malignancy after lung cancer. Varieties of factors such as estrogen and progesterone receptors, and axillary lymph node involvement may influence the prognosis and therapeutic  approach. However, mutation in HER2 gene may also affect the prognosis. In this study...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Health

دوره 2 7  شماره 

صفحات  -

تاریخ انتشار 2010